DM-DQN: Dueling Munchausen deep Q network for robot path planning
نویسندگان
چکیده
Abstract In order to achieve collision-free path planning in complex environment, Munchausen deep Q-learning network (M-DQN) is applied mobile robot learn the best decision. On basis of Soft-DQN, M-DQN adds scaled log-policy immediate reward. The method allows agent do more exploration. However, algorithm has problem slow convergence. A new and improved (DM-DQN) proposed paper address problem. First, its structure was on by decomposing into a value function an advantage function, thus decoupling action selection evaluation speeding up convergence, giving it better generalization performance enabling decision faster. Second, robot’s trajectory being too close edge obstacle, using artificial potential field set reward drive away from vicinity obstacle. result simulation experiment shows that learns efficiently converges faster than DQN, Dueling DQN both static dynamic environments, able plan paths obstacles.
منابع مشابه
Geometric Concept Acquisition in a Dueling Deep Q-Network
Explaining how intelligent systems come to embody knowledge of deductive concepts through inductive learning is a fundamental challenge of both cognitive science and artificial intelligence. We address this challenge by exploring how a deep reinforcement learning agent, occupying a setting similar to those encountered by early-stage mathematical concept learners, comes to represent ideas such a...
متن کاملDueling Network Architectures for Deep Reinforcement Learning
In recent years there have been many successes of using deep representations in reinforcement learning. Still, many of these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders. In this paper, we present a new neural network architecture for model-free reinforcement learning inspired by advantage learning. Our dueling architecture represents two ...
متن کاملMobile Robot Path Planning Based on Improved Q Learning Algorithm
For path planning of mobile robot, the traditional Q learning algorithm easy to fall into local optimum, slow convergence etc. issues, this paper proposes a new greedy strategy, multi-target searching of Q learning algorithm. Don't need to create the environment model, the mobile robot from a single-target searching transform into multitarget searching an unknown environment, firstly, by the dy...
متن کاملDeep Exploration via Bootstrapped DQN
Efficient exploration in complex environments remains a major challenge for reinforcement learning. We propose bootstrapped DQN, a simple algorithm that explores in a computationally and statistically efficient manner through use of randomized value functions. Unlike dithering strategies such as -greedy exploration, bootstrapped DQN carries out temporally-extended (or deep) exploration; this ca...
متن کاملRobot Path Planning Using Cellular Automata and Genetic Algorithm
In path planning Problems, a complete description of robot geometry, environments and obstacle are presented; the main goal is routing, moving from source to destination, without dealing with obstacles. Also, the existing route should be optimal. The definition of optimality in routing is the same as minimizing the route, in other words, the best possible route to reach the destination. In most...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Complex & Intelligent Systems
سال: 2022
ISSN: ['2198-6053', '2199-4536']
DOI: https://doi.org/10.1007/s40747-022-00948-7